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SERK (Receptor Kinase SEQ ID NO: 21) 
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Secretion signal underlined 
■■ Leucine rich region 
Hi Proline box 
HH Transmembrane domain 



J Protein kinase domain 

■ Subdomain I: Glycine triad 

■ Subdomain II: Invariant lysine 
-* Subdomain VTb: Catalytic loop 

■ Subdomain Vn/VDI: Activation loop bounded by invariant DFG and APE motifs | 
I Subdomain IX: Invariant d and g 



" Motifs & Modules 
\ 

Peptide Motifs and Protein Modules in Cell 

Signalling 

A great leap in the understanding of cellular signal transduction 
pathways came with the realisation that... 

♦ certain linear amino acid sequences (or "motifs") 

* as well as certain 3-dimensionaI folded domains (or 
"modules") 

,..are contained within the structures of (often unrelated) 
diverse proteins involved in signalling. Although a few of these 
motifs are found in proteins not involved in signalling, many 
are unique to signalling molecules. 

Modules are tightly folded discrete structures, many of which 
can be inserted into unrelated proteins during evolution, 
without effect on the overall structure/function of the acceptor 
protein. SH~2 and SH-3 domains are examples of modules 
found in many unrelated types of proteins involved in signal 
transduction. 

Searching protein databases for the presence of such motifs 
and modules allows identification of signalling functions in 
previously uncharacterised sequences. 

1. Protein Kinases 



Definitions... 

Kinase:* an enzyme which catalyses the phosphorylation of an 
acceptor molecule, with ATP (usually) acting as the phosphate 
(phosphoryl) donor. You will be familiar with kinases in glycolysis 
which transfer phosphate to carbohydrates - e.g. hexokinase. 
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Protein kinases:- transfer phosphate to specific proteins* The 
> phosphate either tags the protein or alters its subsequent activity. 

There are basically two types of protein kinases* 

(a) . Serine/threonine protein kinases - which phosphorylate 
either serine or 

threonine 

(b) . Tyrosine protein kinases - which phosphorylate tyrosines 

Members of the tyrosine protein kinase family may be either 
receptor tyrosine kinases or non-receptor tyrosine kinases. 

Both Ser/Thr- and Tyr kinases share a homologous stretch of 
approximately 300 amino acids which represents the core 
catalytic site 

We shall use the insulin receptor as an example since it 
contains not only a tyrosine kinase domain, but also many 
other motifs and modules found in signal transduction 
molecules* The insulin receptor can be thought of as a dual- 
functional protein containing an extracellular recognition site for 
insulin binding and an intracellular catalytic site which 
phosphorylates tyrosines. 



Figure 1.1. shows the complete human insulin receptor 
sequence. Note that numbering varies between papers 
depending on whether the signal sequence and/or the splice 
variant region are counted. 



Figure 1.1. insulin receptor sequence 

slcma2 jpeptidt? 27 1 

mtitr* sJjt&s-GbsjJt 1 4 

HLY PGEVCPGMBI RNHLTRLKEL EKCSTIEGHL 
34 61 QILLttFKTKF EDFRDLSFFK LHHTDYIXL FRYYGLE5LK DLFFStTVIR (xSKLFFHY&L 
94 121 7IFEOTHLKE LGLYHLHHIT EJ&SmEKHN ELCYIATOT SRILDS7EDH HIVLMKDDHE 
154 181 EC&DXCF&T& HBKTHCFA.TV IBGQFYEFCff THSHCQK7CP TICKSHBCTA E&LCCHSECL 
214 241 GHCSQPBDPT KCVACBNFYL D&RC7ETCPF FYYHFQDWKC YHFSFCQDLH HKCKffSRRQG 
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334 361 T7IHGSLIIK IB6&mAAE LEAHLGUEE IS&YLKCRRS YALTSLSFFR KLRLIRSETL 

394 421 EIGHYSFYAL DHQNIRQL" WSKHKLTTTQ SKLFFHYBPK LCLSEIHKME ETSGTKGRQE 

454 481 RMBIALKTH& DKASCENELi KFSYIRTSFD KTLLIWEFW PPBFRDUGF ttLFYKEAPYQ 

514 541 H7TEFB6QDA C&SHSffT77D IDPFUSHBP KSQBHPGWLM KSLKPTOYA IF7KTLTTFS 

574 601 DEERTYGAKS DIIY7QTDAT HPS7PLDPIS 7SHSSSQIIL KffKPPSDPHB HITHYLVFffE 

634 661 RQAEDSELFE LDYCLH&LKL PSIOTSFPFE SEDSQKHKQS EYEDSA&ECC SCPKTBSQIL 

664 721 KELEESSFRK TFEDYLHHTV F7PRKTSSGT GAEDPRPSBK PR 

736 

mttve J*ts~cJt82Ji Simm T7AVPTVAAF 

754 781 PHTSSTSVPT SFEEHRPFEK HHKESLVIS &LRHFT&YRI ELQACBQDTP EERCSV&&YT 
814 841 SARTMPEAKA DBIV&PTTHE IFEHHT7HUI ffQEPKEPHGL ITtYEYSYRR Y&DEELHLC7 
S74 901 SRKKFAIERG CP1RGLSPGN YSVRIEATSL AfflD&STOPT YFYVTDYLD7 PSKIA flniSl 
934 961 IPLIF7FIFS7 7IGSIYIFL1 R KRQPDGFL&P LUSSHPEYL S&SD7FPCS7 YYPDE1TCYSR 
994 1021 EKITLLRELG QGSFGM7YEG HARDIIKGEA ETK7AYKTYN ES&SLRERIE FLKEASTO& 
1054 1081 FTCBH77RLL ©77SKBQPTL YYHELM&H&D LKSYLRSLRP EftEHUFGRPP PTLQEtUQtft 
1114 1141 AEIADSMAYL NAKKF7 HRSL AARffCOTAHD FTVKT&DFGM TRDIYETDYY RK5GKOLLP7 
1174 1201 KOT1APESLKD G7FTTSSBI57 SFG77LWEIT SLAEQPYQGL SHEQ7LKF7M BGGYLDQPDH 
1234 1261 CPER7TDLMR tOTQFBPKMR PTFLEI7KLL KDDLHPSFPE 7SFFHSEEHK APESEELEHE 
1294 1321 FEBMEN7PI.D RSSHCQREE& GORBGOSSL& FKRSYEEHIP YTHHHBGKHff &PILTLPRSH 
1354 1381 PS 

l_ , , . 

grey = numbering oi pro-form (before processing) 

-27-0:- Signal peptide (cleaved ofi during ER 1-27 
1-735:- Mature o chain (ligand-binding , extracellular) 28—758 
718-729:- Splice variant region, (missing in short isoiorm) 745—756 

736-1355:- Mature p-cbain (catalytic and regulatory, cytosolic) 763-1382 

Transmembrane domain (BOXED) 

Catalytic domain 

i 

1003-1011:- Rossroann motii ftri-glvcvl + Ivs^ P-ancbor) 



1150-1179 



1130-M39 



- Activation segment f= activation* & F+l loops V 



- Catalytic loop 



1.2. Structural and functional features shared by all protein 
kinase enzymes 

The work of Steven Hanks led to the recognition that all protein kinases 
have conserved residues and homologous stretches centred on 12 sub- 
domains within the approximately 300 amino acid kinase stretch (see 
Figure 1.2.) 

Figure 1.2. ALIGNMENTS OF PROTEIN KINASE CATALYTIC SITES - SUB-DOMAIN ASSIGNMENTS ACCORDING TO 

Hanks (1988) Science 

I II III 

43 64 65 83 98 

PKA FERKKTLGTGSFGRVMLVKHKA 1 TEQYYAMKILDKQKVVKLKI QIEHTLNEKRILQAV 

PKC FNFLMVLGKGSFGKVMLSERKG 1 TDELYAVKILKKDWIQDD | DVECTMVEKRVLALPG 

Src L RLEVKLGQGC FGE VWMGTWNG 1 -TTRVAI KTLKPGTM 1 SPEAFLQEAQVMKKL 

IR ITLLRELGOGSFGMVYEGNARDIIKGE j AETRVAVKTVNESASLR— I ERIEFLNEASVMKGF 

IV V 

99 113 137 

PKA NFPFLVRLEYAFKDN | SNLYMVME WPGGEMFSHLRRI GR 

PKC KPPFLTQLHSCFQTM | DRLYFVMEYVNGGDLMYHI QQVGR 

Src RHEKLVQLYAWSE- 1 EPIYIVTEYMSKGSLLDFLKGETGKY 
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IR TCHHWRLLGWSKGI QPTLWMELMAHGDLKSYLRSLRPEAENNPGRPP 

Via VIb VII 

138 160 178 195 

PKA FSEPHARFYAAQIVLTFEYLHSL | DLI YRDLKPENLLIDHQG | YIQVTDFGFAKRVKGRT 

PKC FKEPHAVFYAAEIAIGLFFLQSKf GIIYRDLKLDNVMLDSEGI HIKIADFGMCKENIWDGVTT 

Src LRLPQLVDMAAQIASGMAYVERMi NYVHRDLRAANI LVGENL I VCKVADFGLARLI EDNE YTAR 

I R PTLQEMI QMAAEI ADGMAYLNAK | KFVHRDLAARNCMVAHDF | TVKI GDFGMTRDI YETDYYRKG 

VIII IX X 

196 210 240 260 

PKA WTLCGTPEYIAPEI 1 1 LS KGYNKAVDWWALGVLI YEMAA-GYP P F FA i DQPIQIYEKIVSG-KVRFPSH 
PKC KTFCGTPDYIAPEI 1 1 AYQPYGKSVDWWAFGVLLYEMLA-GQAPFEGf EDEDELFQSIMEH-NVAYPKS 
Src QGAKFPI KWTAPEAA | LYGRF T I KS DVWS FGI LL TE LT T KGRVPYPG | MVNREVLDQVE RGYRMPC PPE 
I R GKGLLPVRWMAPESL 1 KDGVFTTSSDMWSFGWLWEI TSLAEQPYQG | LSNEQVLKFVMDGGYLDQPDN 
XI 



261 297 



PKA FSSDLKD-LLRNLLQVDLTKRFGNLKNGVSDI KTHKWF 
PKC MSKEAVA-ICKGLMTKHPGKRLGCGPEGERDIKEHAFF 

Src CPESLHD-LMCQCWRKEPEERPTFEYL QAFL 

IR CPERVTD-LMRMCWQFNPKMRPTFLEIVNLL KDDL 



PKA= cAMP-dependent protein kinase p-type catalytic sub-unit (from amino acid 43) 

PKC= Protein kinase C pi (from amino acid 339) 

Src= Non-receptor protein tyrosine kinase (from amino acid 267) 



IR— Insulin recptor (from amino acid 996) 



See Steven Hanks Web site 

1.3. Catalytic Domains of Protein Kinases 

Not surprisingly, many of the conserved residues were found to have 
essential roles to play in catalysis. Of particular importance are three 
loops:- the 'P-loop* (sub-domain I); the 'C-loop' (sub-domain VIb) and 
the 'A-loop' (subdomains Vn/Vm). See Figure 1.3. 



Figure 1.3. Protein kinase catalytic site loops 
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I 


II 


ni 




(P-loop plus a lysine) = Rossmann Motif 




PKA 


LGTGS FGRVML VKHKA 


TEQYYAMKI LDKQKWKLK 


QIEHTLNEKRILQAV- 


PKC 


LGKGSFGKVMLSERKG 


TDELYAVKI LKKDWI QDD 


DVECTMVEKRVLALPG 


Src 


LGQGCFGEVWMGTWNG 


-TTRVAIKTLKPGTM 


SPEAFLQEAQVMKKL- 


InR 


LGQGSFGMVYEGNARDI IKGE 


AETRVAVKTVNE SAS LR— 


ERIEFLNEASVMKGF- 




VTb 


VII 


VIII 




Catalytic loop 


Activation segment (A-Ioop & P+l-loop) 


PKA 


DL I YRDLKPENLL IDHQG 


Y I QVTDFGFAKRVKGRT 


WTLCGTPEYLAPEII 


PKC 


GI I YRDLKLDNVMLDSEG 


HIKIADFGMCKENIWDGVTT-- 


KTFCGTPDYIAPEI I 


Src 


NYVHRDLRAANI LVGENL 


VCKVADFGLARLIEDNEYTAR- 


QGAKFPIKWTAPEAA 


InR 


KFVHRD1AARNCMVAHDF 


T VK I GDFGMTRD I YE TDYYRKG 


GKGLLPVRWMAPESL 



1.3a. The Rossmann Motif 



All kinases (including protein kinases as well as those which 
phosphorylate metabolites or lipids) contain a characteristic motif 
in their active site , called a "Rossmann Motif". 

This consists of a triad of glycines:- 
Gly.Xxx.Gly.Xxx.Xxx.Gly (Xxx^any amino acid), and a 
conserved lysine. [See;- Bossemeyer, D. (1994) TIBS, 19: 
201-205] 



The Rossmann motif is also found in non-kinase proteins 
which bind mononucleotides (ATP,GTP) and 
dinucleotides (NAD,NADP,FAD). 
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. For example the guamne nucleo«ide-bindingpro tti ns (O- 
proteins) such as 

motifs in their nucleotidettnd^sites. 



. In Harvey Ras, the sequence:- 
Gly.Ala.Gly.Gly.Val.Gly.Lys.Ser is found 

inLoop 1 between P strand 1 and the begining of ahelix 1 
(residues 10-17, 

ft. "P-Loop"). SeeFigure 1.4. In**-* thesis between 
two p- 
strands 



Figure 1.4. The Rossmann 



motif of Ras with GTPaiu^bo^ 



GLY15A 



LY12A 



1.2b. P-loop 



, Catalytic loop and Activation Segment 



-SSKSBSSSS.'S- 
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Figure 1 5 Subdomain structure of protein kinase catalytic domains 



Subdomain I contains the 'P-LOOP' 
with its glycine triad (GXGXXGX), 
which bmds the ctfiy phosphates ~ 
ATP j 



S ubdoroain II contains an invariant 
lysine which interacts with a- and 
(J-phosphates 

III IV 



PKA- 


LGTGSFGRV 


AMKIL 


PKC- LGKOSFGKV 


AVKIL 


sn- 


LGQGCFGEV 


AIKTL 


iff - 


LGQGSFGMV 


AVKTV 




Via 



VI b 





YRDL 




PEM 




TRDEi 




DDK 




HRWj 




HAS 




HRDLAAllS 



NEK 
VEK M 
QEA ' 
NEA 

~f- 

III Invariant gjutamate 



SubfomainVIbisthe 
CATALYTIC LOOP. 

It contains an invariant 
| lysine in Ser/Thi kinases 

- in Tyr kinases this is 
replaced with an aigjnine 



d 



VII/VIII 



DFGFAKRVKGRT WTLCGTPEYLAPE 

DFGMCKE N I WDGVTT — KTF CGTPD Y I APE 
DFGLARLIE DNE YTAR-QGAKF P IKWTAPE 
DFGMTRD I YE TD Y YRKGGKGLLPVRWMAPE 




TAPE \ 
MAPE ] 



Subctomains VII and VIII contain the ACTIVATION 
LOOP hounded by invariant DFG and APE motifs. 
In newly synthesised kinases a threonine (Sei/Thr 
kinase), or a tyrosine (Tyr kinases), is 
hydrogen-bonded to a serine in subdomain I - this 
blocks the active site, causing AUTO-INHIBITION 



Catalytic activity and auto-inhibition 
mechanisms 



[Hanks, S.K., Quinn, A.M. & Hunter, T. (1988) Science, 241: 42-52; Johnson, L.N., 
et al., (1996) Cell, 85: 149-158; Frankel, M, et al. (1999) Protein Science, 8: 2158- 
2165}] 



A common feature of protein kinases is that they require a 
residue in the ACTIVATION LOOP to be phosphorylated 
before they can become activated. 
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(a) Sorae protein kinases are simply controlled by 
phosphorylation and de-phosphorylation of these activation 
loop residues - examples are MAP kinase and the insulin 
receptor tyrosine kinase. 

(b) Other kinases, especially those controlled by soluble 
second messengers (e.g. PKA and PKC), are synthesised, then 
activated by autophosphorylation, whilst still being processed. 
The mature forms of PKC and PKA are phosphorylated on 
equivalent threonine residues (Thrl97 in PKA) in their 
activation loops, but then become auto-inhibited by a 
different, secondary mechanism - the binding of 
'PSEUDOSUBSTRATE SEQUENCES' to their active sites 
(see later lectures). 



Catalytic-activation loop interactions 

• Sub-domain I consensus sequence: Gly-X-GIy- 
X-X-Gly (aa's>50-55 in PKA) wraps around the 
phosphates of ATP, the amide bond nitrogens of the 
glycines providing a positively-charged electrostatic 
field which binds a and P phosphates. A serine H- 
bonds to either pseudosubstrate sequences or 
autophosphorylation sites (often found in 
subdomain VIII). 



♦ Sub-domain II contains an invariant Lys 

(corresponds to Lys72 of PKA) which binds to a 
and P-phosphates. The lysine is held in position by a 
salt-bridge with Glu9L 
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&e garmna phosphate » Serrthr 
kinases is replaced by Arg in Tyr 
kinasesXysinelc* interaction w.th the Y" 

c k Hnmain VIII represents the ACTIVATION 

'^ nT^nVnfPKA) It contains consensus 

LOOP ( 8 t 208 r f r 7p } its deletion in Src leads 

triplet: A^P^-<£^ * to subdomain are 

to an inactive ^'^^^ the activation 
often autophosphorylated as part ot tne 

mechanism (See insulin receptor). 

Activation loop blocks active site in un- 
phosphorylated form 

• 1 0-7 r»f Protein Kinase A (PKA) is H-bonded to 
PKA's protein substrates. 

substrates in its active site. 
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16 Oct 2001 Sequence Data Page 1 

Molecule: SeqID20.txt, 4081 bps DNA 
Description: 

File Name: seq id 20.cra5, dated 16 Oct 2001 

Printed: 1-4081 bps (Full), format Annotated: Enzymes, Genes 

Xcml 



1 tctagaaacc ttttgatcat aatgaaaata aagagtccat ccaccacatg 

51 gggtaagcat aatgtgtgat atttaaaggg taacaaatgt aatctgcttt 

101 ttattttact ttttacctct actqaaattg tatgggcagt tttttttttt 

151 ttttaaatga taagacaagt atctgtttaa tggtattgtg atgaaacagt 

BseSI 

201 agtaaagtca tatcgggcac gccatactac ttccacagtg gaacttggcc 

BsmBI HincII 

251 aaattttgtc tttgccgtct ctacagtttc ttccaccaaa ttttttgttg 

301 acaaaactca aatctttcaa tctcatctct gccaaagttg ggtttagaaa 

351 gaatatcagc aaacactaat atctttattg ttgcatggtt tatcaatcac 

401 aaaattcaca accattgtaa aaaaaaattc acatttttgg tatgagattg 

451 ctcacatgat agtgaacctc tttaacattt taactttact ttcataaata 

501 cgggattacg aatcttactt gcattaaaaa tttagaaaag gtttttctac 

SanDI 
PpuNfl 



EcoO!09I 



551 ttaaagaaaa aagggaccca acagagagag gtttgaccag gagaaacggg 
601 tgcatagcct taagagcttt caactacttt accccaaacc caaagcgatg 

Agel 



651 tcactttcaa ccatctcttc tctcccccga acccgttttt ttgaccggtc 



701 agttcgggca gcagcaccgt tacgggcagc ttatattcct cgtcttcctc 



SeqID20.txt Page 

SpM 

751 ctctacacca ctgcatgccc ataaataaag cccgttgaga tctttaaaaa 

801 tattaaataa tatatcaacg aaaaagctat tttattcata agaagaaaaa 

851 gagaggaaca acaacaacac actaatcata gtttctctgg caggcttgtt 

901 gttgcggctt aataaaaagc tcttttgtta ttattacttc acgtagattt 

951 tccccaaaaa gctcttattt ttttgtttaa aaaaaaaagt ttcatcttta 

1001 ttcaactttt gttttacagt gtgtgtgtga gagagagagt gtggtttgat 

1051 tgaggaaaga cgacgacgag aacgccggag aattaggatt tttattttat 

1101 tttttactct ttgtttgttt taatgctaat gggtttttaa aagggttatc 

1151 gaaaaaatga gtgagtttgt gttgaggttg tctctgtaaa gtgttaatgg 

1201 tggtgatttt cggaagttag ggttttctcg gatctgaaga gatcaaatca 

1251 agattcgaaa tttagcattg ttgtttgaaa~ tggagtcgag ttatgtggtg r < 

.............. ewm 1 » 



1301 tttatcttac jtttcactgat cttacttccg aatcattcac tgtggcttgc 
».„..™..;..„^ .......... — 



1351 ttctgctaat ttggaaggtt cgtggttact caattactca gctttactcg 
» 1 ....*..»> 



1401 tttctcaatt actttctcga ttctttttta tttggaggtg aatcgctatc 



1451 tttagtgtct gcattttgat ttatgaaaat tgttgttgtt ctttgtattt 

TatI 
Seal 

1501 gtaagattta gtggctagta ctttgaatac actgttttgc ttttcttgtt 



1551 cagatcaact ttgtatattg taaaggcatg ttctttgggt tgaaaagctg 
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EcoRV 

1601 ggttatttga tatcttaaga ttgatgttgt tgatccaaac attctctgaa 
1651 agacttcatt tgtttttggt tttgtaaaga atttgtttaa ttattagcct 

StuI 

1701 ctaatctcag agaggcctgt ttgaatagtt ctctcttgaa attagacttt 

Muni 

1751 tcaccaattg atgctaattg tgtagatttg ttgttcttgt- tataggtgj^j, 



1801 xjctttgcata. ctttgagggt tactctagtt : gatccaaaca atgtcttgca 

BamHI 



1851 gagctgggat cctacgctag tgaatccttg cacatggttc catgtcactt 

» ^ ^ . exon 2_ .>> 



1901 gcaacaacga gaacagtgtc ataagagtgt aaagctttct tctactaatc 

» «x»n2 „>>>> 



BpulOI 



BbvCI BstEII 



1951 ccacttttta aactttgacc tcagcgtggt taccgacatt tttgtttctt 

BsmI 

2001 ttgtcaaata cagtgatttg gggaatgcag agttatctgg ccatttagtt 

>»> jexon3, 0 „ 0 ,^„„ 4 ■ .,;.;» 



PpulOI 



2051 ccagagcttg gtgtgctcaa gaatttgcag tatttgtaag ttccacttat 

» , exon3 .„„»» 

Nsil 



BfrBI 



2101 gcatcatgct ttaacaaaac aaatccaaga tttgacagaa gaagcactgg 
2151 agttaccttt tgtaattgaa atctttttaa caagtttctt attttcttac 



2201 agggagcttt acagtaacaa cataactggc ccgattccta gtaatcttgg 



2251 aaatctgaca aacttagtga gtttggatct ttacttaaac agcttctccg 

fc*;;,;..;...,......^,.....; .«a>»4.,*,.„..„.„..« 
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Acclfl 

2301 gteGtattGGiLggaatpattg^ggaaagcttt .eaaagctgag ,atttctgtga 



Ndcl 



Bstll07I 



2351 gtatacatat gctttaccgg ctcagttaca gtctttgttt aatcttaggt 



2401 tttgttccaa tttttgactc tttgctgaaa attttacatg caagaata^. 



Nad 



2451 -cggcttaaca acaacagtct „caqtgggtca attcctatgt cactgaccaa 



2501 *tattactacc cttcaagtgt tgtgagtcct ctcattaact ttcatttatg 



2551 tctacttcat tctccctcag ttgatttgtt gagttaatgc acttaacctt 



2601 gatggatgca acacagagat ctatcaaata acagactctc tggttcagtt 

»».„*.♦„..»..*....>«.. .«w*6. . ;.» 



2651 cctgacaatg gctccttctc actcttcaca cccatcaggt tctatgattt 

».. M .„«wn6 . .... »» 



2701 atcctcttca gttatttcag ttgttgtgtc agtgtctgaa cttattctga 

Bsgl 

2751 aactttcatt tccttgtgca gttttgctaa taacttagac ctatgtggac 

»».....„., «. . exon , » 



2801 ctgttacaag tcacccatgt cctggatctc ccccgttttc tcctccacca 

»„.*. exotic, , 



2851 ccttttattc aacctccccc agtttccacc ccgagtaagc ctcctctttt 
;»„.„„. . . ~„y>» 



2901 tagtttacat tataggaaac agaagatgaa atctttgctt ctctgtcaat 



2951 cctttttctc atataactca tcttgccaat aaggcaataa ccaaatgatc 



SeqID20.txt 



Page 



3001 taatttgatt tcaggtaggt 



BstAPI 



BspMI 



Aarl 



PstI 



3051 ttgctgcagg tgctgctttg ctctttgctg ctqctgcaat agcctttgct 



3101 tggtggcgac gaagaaagcc actagatatt ttcttcgatg tgcctggtga 
................. »» 



3151 gtttattatt cgcattagtt tctgttctta gccagcaatt ttgttttgca 
3201 gaaaagtatt ggaacaactg ttaatgaaaa tcaatacata agtcattgtt 
3251 ttttaagtta caaactcttt tgagtaaaat ctcgattgca aaatctctat 



3301 gcagccgaag aagatccaga agttcatctg ggacagctca agaggttttc 

»» „ P .jw»5 » 



3351 tttgcgggag ctacaagtgg cgagtgatgg gtttagtaac aagaacattt 

» mm exon9 M , 



3401 tgggcagagg tgggtttggg aaagtctaca agggacgctt ggcagacgga 

» «xon9 » 

Sapl 



3451 actcttgttg ctgtcaagag actgaaggaa gagcgaactc caggtggaga 

» ....... + ..... ejton9. . , » 



Sad 



Banll 



3501 gctccagttt caaacagaag tagagatgat aagtatggca gttcatcgaa 

». . „„„.,«x°n9. M ...„>> 



BsiEI 



3551 acctgttgag attacgaggt ttctgtatga caccgaccga gagattgctt 



3601 gtgtatcctt acatggccaa tggaagtgtt gcttcgtgtc tcagaggtaa 

4 »„.,... .............. cxon 9.. t .„..,„ M . »>> 
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Btrl 

3651 aaactaaaca attaaacatc ttgtgctctc tctcaattac tttgacgtga 

3701 agtgtttttt catgttttcc tttatgggtt cataattgtt ggttacacta 

3751 atgacacaga„.gaggccaccg tcacaacctc cgcttgattg gccaacgc^g 

1<L... . ..„™)> 

Xhol 

Dpull02I 



3801 aagagaatcg cgctaggctc agctcgaggt ttgtcttacc tacatgatca 

* 4BW^i 10 . » 



3851 ctgcgatccg aagatcattc accgtgacgt aaaagcagca aacatcctct 

».„.*.„..... ,ex° n 10 » 

EcoRI 



3901 tagacgaaga attcgaagcg gttgttggag atttcgggtt ggcaaagcta 

„.«..„. t «„«.„ CXOn W 4 „ „...,*.., 4. „...,...» 

Pmfl BanI 



OIU 



3951 atggactata aagacactca cgtgacaaca gcagtccgtg gcaccatcgg 

3»„. M , „ exon 10,^ , » 



4001 tcacatcgct ccagaatatc tctcaaccgg aaaatcttca gagaaaaccg 

t t exon 10. M » 



4051 



acgttttcgg atacggaatc atgcttctag a 

» .exon 10 w4 »» 



